NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Joint Selection: Adaptively Incorporating Public Information for Private Synthetic Data

Fuentes, Miguel; Mullins, Brett C; McKenna, Ryan; Miklau, Gerome; Sheldon, Daniel (May 2024, Proceedings of Machine Learning Research)

Full Text Available
Joint Selection: Adaptively Incorporating Public Information for Private Synthetic Data

Fuentes, Miguel; Mullins, Brett C; McKenna, Ryan; Miklau, Gerome; Sheldon, Daniel (May 2024, International Conference on Artificial Intelligence and Statistics (AISTATS))

Full Text Available
Measure-Observe-Remeasure: An Interactive Paradigm for Differentially-Private Exploratory Analysis

Nanayakkara; Priyanka; Kim, Hyeok; Wu, Yifan; Sarvghad, Ali; Mahyar, Narges; Miklau, Gerome; Hullman, Jessica (May 2024, IEEE Symposium on Security and Privacy)

Full Text Available
AIM: an adaptive and iterative mechanism for differentially private synthetic data

https://doi.org/10.14778/3551793.3551817

McKenna, Ryan; Mullins, Brett; Sheldon, Daniel; Miklau, Gerome (July 2022, Proceedings of the VLDB Endowment)

We propose AIM, a new algorithm for differentially private synthetic data generation. AIM is a workload-adaptive algorithm within the paradigm of algorithms that first selects a set of queries, then privately measures those queries, and finally generates synthetic data from the noisy measurements. It uses a set of innovative features to iteratively select the most useful measurements, reflecting both their relevance to the workload and their value in approximating the input data. We also provide analytic expressions to bound per-query error with high probability which can be used to construct confidence intervals and inform users about the accuracy of generated data. We show empirically that AIM consistently outperforms a wide variety of existing mechanisms across a variety of experimental settings.
more » « less
Full Text Available
Winning the NIST Contest: A scalable and general approach to differentially private synthetic data

https://doi.org/10.29012/jpc.778

McKenna, Ryan; Miklau, Gerome; Sheldon, Daniel (December 2021, Journal of Privacy and Confidentiality)

We propose a general approach for differentially private synthetic data generation, that consists of three steps: (1) select a collection of low-dimensional marginals, (2) measure those marginals with a noise addition mechanism, and (3) generate synthetic data that preserves the measured marginals well. Central to this approach is Private-PGM, a post-processing method that is used to estimate a high-dimensional data distribution from noisy measurements of its marginals. We present two mechanisms, NIST-MST and MST, that are instances of this general approach. NIST-MST was the winning mechanism in the 2018 NIST differential privacy synthetic data competition, and MST is a new mechanism that can work in more general settings, while still performing comparably to NIST-MST. We believe our general approach should be of broad interest, and can be adopted in future mechanisms for synthetic data generation.
more » « less
Full Text Available
Relaxed Marginal Consistency for Differentially Private Query Answering

McKenna, Ryan; Pradhan, Siddhant; Sheldon, Daniel; Miklau, Gerome (January 2021, Advances in Neural Information Processing Systems (NeurIPS))

Many differentially private algorithms for answering database queries involve a step that reconstructs a discrete data distribution from noisy measurements. This provides consistent query answers and reduces error, but often requires space that grows exponentially with dimension. Private-PGM is a recent approach that uses graphical models to represent the data distribution, with complexity proportional to that of exact marginal inference in a graphical model with structure determined by the co-occurrence of variables in the noisy measurements. Private-PGM is highly scalable for sparse measurements, but may fail to run in high dimensions with dense measurements. We overcome the main scalability limitation of Private-PGM through a principled approach that relaxes consistency constraints in the estimation objective. Our new approach works with many existing private query answering algorithms and improves scalability or accuracy with no privacy cost.
more » « less
Full Text Available
Investigating Visual Analysis of Differentially Private Data

Zhang, Dan; Sarvghad, Ali; Miklau, Gerome (January 2020, IEEE transactions on visualization and computer graphics)
null (Ed.)
Full Text Available
A workload-adaptive mechanism for linear queries under local differential privacy

https://doi.org/10.14778/3407790.3407798

McKenna, Ryan; Maity, Raj Kumar; Mazumdar, Arya; Miklau, Gerome (January 2020, VLDB)
null (Ed.)
Full Text Available
Graphical-model based estimation and inference for differential privacy

McKenna, Ryan; Sheldon, Daniel; Miklau, Gerome (June 2019, Proceedings of Machine Learning Research)
null (Ed.)
Many privacy mechanisms reveal high-level information about a data distribution through noisy measurements. It is common to use this information to estimate the answers to new queries. In this work, we provide an approach to solve this estimation problem efficiently using graphical models, which is particularly effective when the distribution is high-dimensional but the measurements are over low-dimensional marginals. We show that our approach is far more efficient than existing estimation techniques from the privacy literature and that it can improve the accuracy and scalability of many state-of-the-art mechanisms.
more » « less
Full Text Available
MithraRanking: A System for Responsible Ranking Design

https://doi.org/10.1145/3299869.3320244

Guan, Yifan; Asudeh, Abolfazl; Mayuram, Pranav; Jagadish, H. V.; Stoyanovich, Julia; Miklau, Gerome; Das, Gautam (July 2019, Proc. ACM SIGMOD Intl Conf on Management of Data)

Items from a database are often ranked based on a combination of criteria. The weight given to each criterion in the combination can greatly affect the ranking produced. Often, a user may have a general sense of the relative importance of the different criteria, but beyond this may have the flexibility, within limits, to choose combinations that weigh these criteria differently with an acceptable region. We demonstrate MithraRanking, a system that helps users choose criterion weights that lead to “better” rankings in terms of having desirable properties while remaining within the acceptable region. The goodness properties we focus on are stability and fairness.
more » « less
Full Text Available

« Prev Next »

Search for: All records